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Abstract 

We introduce a new smooth estimator of the ROC curve based on log-concave 
density estimates of the constituent distributions. We show that our estimate 
is asymptotically equivalent to the empirical ROC curve if the underlying densi- 
ties are in fact log-concave. In addition, we empirically show that our proposed 
estimator exhibits an efficiency gain for finite sample sizes with respect to the 
standard empirical estimate in various scenarios and that it is only slightly less 
efficient, if at all, compared to the fully parametric binormal estimate in case the 
underlying distributions are normal. The estimator is also quite robust against 
modest deviations from the log-concavity assumption. We show that bootstrap 
confidence intervals for the value of the ROC curve at a fixed false positive frac- 
tio n based on the new e stimate are on average shorter compared to the approach 



bv lZhou and QinI (j2005l ). while maintaining coverage probability. Computation of 
our proposed estimate uses the R package logcondens that implements univariate 
log-concave density estimation and can be done very efficiently using only one line 
of code. These obtained results lead us to advocate our estimate for a wide range 
of scenarios. 

Keywords: Diagnostic test; Log-concave density estimation; Nonparametric estimation; 
Receiver operating characteristic curve; Sensitivity and specificity 



1 Introduction 

The receiver operating characteristic (ROC) curve is a common way of assessing the 
diagnostic accuracy of a diagnostic test with continuous outcome Z that predicts pres- 
ence or absence of a binary trait, typically a disease. The ROC curve is defined as the 
plot of the sensitivity (the true positive fraction) against one minus specificity (the false 
positive fraction) across all possible choices of threshold values 6. Thus, the ROC curve 
displays the range of possible trade-offs between the true and false positive rates. To 
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fix ideas, let F and G denote distribution functions of the test result X and K of a 
non-diseased and a diseased subject, respectively. If F and G are absolutely continuous, 
the ROC curve of the test can be expressed as 



R{t- F, G) 



l-G{F-\l-t)) 



(1) 



where F~^ is the quantile function of F, and t G (0, 1) is the false positive fraction 
corresponding to a cut-off for positivity of the test. 

We suppose that a sample Xi, . . . ,Xm ~ F from the non-diseased population and a 
sample Yi, . . . ,Yn ~ G from the diseased population are available. We further assume 
that all these observations are mutually independent and denote the empirical distribu- 
tion functions based on the samples by and respectively. The empirical quantile 
function for a sample Xi, . . . , X^ is defined as ¥~^{p) = Xi if F(Xj_i) < p < F(Xj) for 
any p e [0,1] and by setting Xq = — oo, Xn+i = oo. In the absence of any further infor- 
mation about F and G, plugging in these empirical distribution functions in ([1]) yields 
the nonparametric maximum likelihood estimator Rm,nit) = R{t; ¥m, ^n) of R{t; F, G), 
an increasing step function that simply consists of plotting the empirical proportions 
i^{Yj > t}/n vs. i^{Xi > t}/m for varying t. Strong consisten c y and strong approx- 
imation properties of Rm,n are provided in iHsieh and Turnbulll (119961 ) . Being a fully 
nonparametric estimator, Rm,n displays the data pattern well but may suffer from sub- 
stantial variability and, as a step function, is not smooth. Moreover, due to the rugged 
form, an empirical ROC curve can have the same true positive fraction corresponding to 
a range of false positive fractions. This is inconvenient when one is interested in finding 
a particular false positive fraction at a specified true positive fraction. 
In applications, it seems sensible to assume that the true underlying ROC curve R is 
in fact sr nooth. Thus we expect an estimator to perform better if some smoothness is 
imposed (ILloyd and Yongj . Il999l Section 1). The most obvious way of doing so is to 
assume a parametric model, estimate the distribution functions F and G on the basis 
of the samples of healthy and diseased subjects, and compute the corresponding ROC 
curve. Assuming a normal distribut ion both in the control and in the cases group is the 
most prominent parametric model fjPepd . l2003l . Example 5.3). 

Among the frequentist approaches that are between the two "extreme cases", the entirely 
nonparametric and the parametric ROC curve estimate, are semiparametric models. 
Such models do not assume a para metric form for the constituent distributions but 
rather for the ROC curve directly (jCai and Moskowitzl . l2004l ). The most prominent 
semiparametric model for the ROC curve stipulates the existence of a function h such 
that h{X) ~ A^(0, 1) and h(Y) ~ N{fi, cr^). The binormal ROC curve is then given by 



(2) 



where a and b are to be estim ated and $ is the cumula tive distribution (CDF) of a 
standard normal distribu tion. iHsieh and Turnbulll fll996l) propose a generaliz ed least 
squares estimate whereas Cai and Moskowitzl ( 2004 ) and Zhou and Lin ( 2008! ) discuss 
profile likelihood methods to estimate a and b in the general setup ([2]). 
With the purpo s e of d efining a smooth, though more flexible than parametric, ROC curve 
estimate iLlovdl fll998l ) proposes to plug in kernel estimates of F and G into ([T]). As is 
common for all kernel estimates, a kernel and, more importantly, a bandwidth have to be 
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chosen. In the ROC curve estimation problem this issue is even more pronounced, since 
both the ordinate and coordinate of the estimated ROC curve ar e random and it is thus 
unclear which variation to minimize in mean square. iLloydl ( 119981 ) suggests to circumvent 
the bandwidth choice problem by a rather involved graphical examination of bias and 
variation from bootstrap samples. That this kern el ROC estimate i s mor e efficient than 
the empirical is shown in the subsequent paper iLlovd and Yond f|l999[). However, a s 
discussed in the bandwidth comparison study performed by lZhou and HarezlakI (120021 ). 
Lloyd's estimate may not be practical for routine use in medical studies, although it 
has good theoretical properties. Its mai n draw backs is that bandwidth selection is 
ad- hoc and difficult. iHall and HvndmanI ( l2003l ) take a somewhat different approach 
as they choose the optimal bandwidth not separately for F and G, but they derive 
optimal bandwidths via minimizing the mean integrated squared error (MISE) for the 
estimation of R directly. They show via simulations that their estimator improves on 
existing approaches in terms of MISE. 

Another approach to smooth estimation is taken in lOiu and Ld (120011) who co mbine 
kernel estimation of G with a quantile estimator due to iHarrell and DavisI ( 1l982f ). The 
latter has been sh own to perforin bett er than the usual empirical quantile estimator 
in many scenarios. iDu and Tang ( 120091 ) propose a monotone spline approach to ensure 
monotonicity and transformation invariance o f the estimated ROC curve. However, here 
one has to select a smoothing parameter and IDu and Tang (120091 ) propose to use cross 
validation to guide that choice. 

Further pape r s that deal with semip arame tric approaches to estimate a ROC curve are 



Llovdl ( 12002[ ). iPeng and Zhoul ( 12004 ). and IWan and Zhangi (120071 ). However, it seems 



safe to say that diagnostic tests in applic ations are to a large exteri t assessed by using 
the empirical ROC curve estimate Rm,n ( IZhou and HarezlakI . I2002L Section 1) and to 
a lesser extent the binormal estimate, despite the well-known deficiencies of the latter. 
Presumably, this has to be attributed to lack of easy accessible software that implements 
alternative methods. 

In Section [2l we briefly summarize some facts on log-concave density estimation. Our 
new estimator is introduced in Section |3l and some theoretical results are provided in Sec- 
tion m An illustration of the proposed ROC curve estimate using a real-world dataset 
is discussed in Section |5l and its performance in simulations is assessed in Section [G] 
Bootstrap confidence intervals for the value of the ROC curve at a fixed false positive 
fraction t are introduced in Section [71 and Section |8l discusses the features of the new 
estimator under misspecification. Finally, in Section [9l we draw some conclusions. Addi- 
tional results on log-concave density estimation, on estimation of AUC, on simulations 
and under misspecification as well as proofs are postponed to the appendix. 



2 Log-concave density estimation 



In Sec tion [31 we propose an alternative ROC curve estimator in the spirit of iLlovd 
( 119981 ). Namely, we also model the constituent distributions F and G nonparametrically. 
However, we do not suggest t o use kerne l s, but t he log-con c ave d ensity estimate ii iitiall y 
introduced in IWaltheij (12002[ ). iRufibachI (120061 ). |Pal et all ( 120071 ). and iRufibachI (l2007f ). 
The features of the new ROC curve estimator are discussed in Section [31 More details 
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on recent research on log-concave density estimation is provided in the appendix. 
Rate of uniform co nvergence for the univar i ate Iq g-concave maximum hkehhood esti- 
mate was derived in iDiimbeen and RufibachI (120091) and its computation is described in 

RufibachI (120071 ) , iDiimbgen et al.l (120101 ) , and lDiimbgen and RufibachI (120111). The corre- 

spon ding software is available from CRAN as the R package logconden s (jRufibach and Diimbgen 
201ll ). The multivariate ca se has been treate d bv ICule et al.l ( 120101 ) and the extension 



to discrete observations in iBalabdaoui et al.l ( 120111 ). For a recent review of all these 



developments we refer to IWaltherl (120091 ). 

To fix ideas, let p be a probability density on M. We call p log-concave if it may be 
written as p{x) = exp(p{x) for some concave function (/? : M — t- [—00,00) and x G M. 
Based on a sample of i.i.d. random variables Vi, . . . , G M from p we seek to estimate 
this density via maximizing the normalized log-likelihood function 



n 



1=1 



i=l 



over all concave functions : M — )■ [—00, 00) such that / exp(p{x) dx = 1. 
T he merits of impos i ng log - concavity as a shape constraint ha ve be en described in detail 



m 



Ba: 



abdaoui et aP ( l2009h . lc"ule et al.l dicKfflh . lWaltherl (l2009l ). and lPiimbgen and Rufibacii 



( I2OIII ). The most relevant of those properties in the current context are that many para- 
metric models have log-concave densities. Examples include the Normal, Exponential, 
Uniform, Gamma(r, A) for r > 1, Beta(a, 6) for a,b > 1, generalized Pareto if the tail 
index 7 is in [—1, 0], Gumbel, Frechet, Logistic or Laplace, to mention only some of these 
models. In addition, log-concavity can be considered a straightforw ard generalization of 



normality that shares many properties of the Normal distribution ( ISchuhmacher et al. 



2OIII ). but is much more flexible. For these reasons, assuming log-concavity offers a flex- 
ible nonparametric alternative to purely parametric models that includes a wide range 
of asymmetric, unimodal densities. 

The crucial feature of pn, the log-concave density estimator and maximiizer of i{(f), for 
our intended application is that the corresponding CDF estimator Pn is, under some 
regularity conditions and if the true density is in fact log-concave, asymptotically equiv- 
alent to P„. As a consequence, P„ can be regarded as a smoother of the empirical 
distribution function P„ for finite sample sizes n and this will directly translate into a 
smooth estimator fo r the ROC curve. Apart f rom t he asymptotic result in Theorem I A. II 
given the appendix, iDiimbgen and RufibachI (120091 Corollary 2.5) have also derived ad- 
ditional features of P„ for finite n that further support the plug-in strategy indicated 
above. Namely, that P„(V^i) = 0,P„(K) = 1, and that P^ix) G [Pn(x) -n-\P„(a;)] 
whenever x is a knot point of the piecewise linear function ip^- 

To conclude the discussion of log-concave density estimation we would like to men tion the 
kernel smoothed versi on of p„ introduced in IDiimbgen and RufibachI ( l2009l . Section 
3) and generalized in IChen and SamworthI (12011'). Closed formulas can be der ived to 
compute p* and the corresponding CDF P*, see IDiimbgen and RufibachI (120 111 ). Since 
log-concavity is preserved under convolution, the smoothed estimate p* r e mains log- 
concave if the applied kernel has this property. In IDiimbgen and RufibachI ( 120091 ) . the 
normal kernel was used with a bandwidth chosen such that the variance of p* equals 
that of the sample Vi, . . . , Vn- As for p„, this makes p* a fully automatic estimator. 
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Note that p* is rather close to p„, see IChen and SamworthI (1201 ll . Proposition 3) for a 
corresponding quantitative result. 



Testing for log-concavity. Typically, shape constraints are motivated through sub- 
stantive considerations as above. However, resea rchers may want to formally assess 



the hypothesis o f log-concavity. iHazeltonl (120111 ) adapts Silverman's bandwidth test 



(ISilvermanl . Il982[ ) to test log-concavity of densities in any dimension and shows that it 
has healthy power in several scenarios. The test works by constructing a kernel esti- 
mate of the target density in which log-concavity is enforced by the use of a sufficiently 
large bandwidth. The test statistic is the minimal bandwidth for which log-concavity is 
achieved and the null distribution is generated via bootstrap. 

In a more exploratory manner, log-concavity can be visually assessed by comparing such 
an estimate to, e.g., a kernel estimate of the same data. 



Illustration of the log-concave density estimates. To illustrate the log-concave 
estimate, we show in Figure [1] density estimates applied to the log of the carbohy- 
drate antigen 19-9 measurements from the pancreatic cancer serum biomarker study, 
for the pancreatitis patients (i.e. controls) only. This data was init ially analyzed in 
Wieand et al.1 (119891) and re-assessed, among others, i n iPepd (120031. Example 1.3.3) , 
Cai and Moskowitzl t004 . Section 6), IWan and Zhangl (l2007h . or Izhou and Linl (l2008h 
to name a few. For details on the dataset we refer to Section [51 In the left plot of Fig- 
ure [H the two log-concave density estimates as well as the standard R kernel estimate 
are displayed. I t seems safe to assume a log-concave density for this data - the test by 
HazeltonI (120111 ) yields a p- value of p = 0.84 (based on 9999 bootstrap samples). Note 
that the log-concave estimate is able to capture the data's pronounced skewness. The 
fact that the log of the estimated density is piecewise linear, see Section A in the ap- 
pendix, can be seen through the potential sharp kinks in the density, as it is the case in 
Figure [H It is clear that such kinks are alleviated by the smoothed version p* . However, 
when looking at the CDFs on the right side of Figure [H we see that the seemingly "nicer" 
fit of p* on the density level does not imply a significant difference for the estimated 
CDFs. The right picture further emphasizes that, as claimed above, the log-concave 
CDF estimator (whether kernel smoothed or not) can be considered to be a smoother 
version of the empirical CDF, a fact theoretically supported by Theorem lA.ll 



3 A new ROC curve estimator 

To define our new ROC curve estimator we first compute log-concave distribution func- 
tion estimates Fm and Gn based on the samples Xi, . . . , and Yi, . . . , The esti- 
mates are then plugged into ([1]) to get 

Rm.,nif) '■= R(t;Frn,Gn) = 1 — (^^(-F"^ (1 — t)) 

for t G (0, 1). The corresponding estimate based on the CDFs derived from the kernel 
smoothed log-concave estimates is denoted by R^ni't)- Being a function of the integral 
of a piecewise exponential function, both estimators are smooth, i.e. they are at least 
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Figure 1: Pancreas dataset: Log-concave density estimation for log of CA 19.9 measure- 
ments, controls only. 



different iable on the entire domain (0, 1) and infinitely often differentiable except on the 
joint set of knot points of frn and 

In Section A of the appendix we recapitulate asymptotic results for pn and P^. These 
results can be used to show that Rm^n, as a process in t, is asymptotically equivalent to 
Rm,n if the log-concavity assu mption holds, see Se c tion [H This implies that the limiting 
behavior derived for Rm,n in Hsieh and TurnbuU f 1996[ ) in some sense applies to Rm,n 



or that asymptotic confidence intervals derived for Rm,n(t) are also valid for Rm,n{t). In 
addition, Rm,n is smooth and, as we show in Section [6l for finite m and n typically more 
efficient than Rm,n- We therefore advocate the use of our new estimator as a surrogate 
for Rm.,n and an alternative to the kernel estimate Rmn when it is safe to assume that 
the constituent densities are log-concav e. 

It was shown in iLloyd and Yong] (119991 ) that the estimated ROC curve based on kernels 
outperforms the empirical, just as Rm,n- One of the main advantages of shape constraint 
estimation in general, and in our current setting in particular, is that such estimates are 
fully automatic, i.e. they do not necessitate the choice of a kernel, bandwidth, or some 
other regularization parameter whose optimal value typically depends on the unknown 
function to be estimated. 

Admittedly, and as it is typical for semiparametric ROC curve estimates, Rm,n is gen- 
erally not invariant with respect to monotone transformations of either X and/or Y. 
However, this is the case for virtually all parametric models. A ROC curve R (true or 
estimated) is biased if there exists p E (0, 1) such that R{p) < p. Log-concave ROC 
curve estimates can indeed be biased. However, the bias is in general constrained to 
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regionsof (0, 1) that are small, and typically smaller compared to the binormal model. 
See iPepd (120031 p. 83) for a detailed discussion of potential bias in binormal models. 
Compared to a parametric model with Normal distributions for F and G, the log-concave 
approach is certainly more flexible, at the cost of only a small reduction in efficiency in 
case F and G are in fact Gaussian, as we verify in our simulation study in Section [61 



4 Main result 



In the sequel, a function g is said to belong to the Holder class 'H^'^{I) of functions 
with exponent /3 G (1, 2] and constant L > if for all x,y & I we have \g'{x) — g'{y)\ < 
L\x — y\^~^- The claim of asymptotic equivalence of Rm,n and Rm,n under log-concavity 
is based on Theorem 14.11 

Theorem 4.1. Assume that 

CI F is supported on Ip = [0, 1] with positive density F' = f , and G has (potentially 
infinite) support Iq ^ IR- 

C2 F and G both have log-concave densities f = exp(p,g = exp7 where (p,j are 
both Holder- continuous with exponent /3 G (1,2] and constant L > on Ip and 
[A, B] C Ig, respectively. 

C3 F and G have log-densities such that (p'{x) — (p'{y) > Gp{x — y) for < x < y < 1 
and 7'(x) — 7'(?/) > Gg{x — y) for A < x < y < B and two constants Gp, Gq > 0. 

C4 The sample size m is a function of n such that n/m ^ X > asn— t-oo. 

Then, as n ^ oo, 

for J = [F{A + 5), F{B - 6)] for every 6 > 0. 

A strong approximation result for the empiric al ROC curv e process wa s provided in 
Hsieh and TurnbuUl ( 1996 . Theorem 2.2), see also Pepe ( 2003 , Result 5.2) or Horvath et al, 
( 120081 . Section 2), and implies the following corollary: 



Corollary 4.2. Let Mi{t) and M2{t) be two independent Brownian Bridges. Then, as 



n 



oo. 



^{R^,n{t) - R{t)) ^d. Bl(l-i?(t) 



/(F-i(l-i)) ^ 



uniformly on J. 



Note that the original result in iHsieh and Turnbulll ( 1l996l Theorem 2.2) relating the 
empirical and the true ROC holds a.s. where Brownian Bridges depending on n are 
involved. This is the reason why we only get convergence in distribution in Corollary 14. 21 
The proof of and some comments on the assumptions of Theorem 14.11 can be found in 
Section D in the appendix. 
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observations 1 - specificity 

Figure 2: Pancreas dataset: Log-concave density estimates for log of CA 19.9 measure- 
ments and induced ROC curves. 



5 Illustration of the new ROC curve estimate 

We illustrate our new estimator and compare it to the empirical Rm,n and binormal 
estimate Rm.,n on the pancreatic cancer serum biomarker dataset. The study examined 
two biomarkers, an antigenic determinant denoted CA125 and a carbohydrate antigen 
denoted as CA19.9. The data consists of 51 controls that did not have cancer but 
suffered from pancreatitis and 90 pancreatic cancer patients. Here, we use the fully 
parametric binormal model, i.e. we estimate a and 6 in (|2]) directly from the mean n 
and variance cr^ of the underlying distributions. The results are displayed in Figure [2] and 
we observe that (1) Given that the empirical estimate follows the true curve closely, the 
binormal model does not seem to be appropriate here, (2) Both log-concave estimates 
are very close to the empirical estimate, acting as smoothers of the empirical estimator 
as discussed above, (3) The difference between the log-concave and the smoothed log- 
concave estimate in estimation of the ROC curve is small and mainly concentrated at the 
boundaries of the domain of the ROC curve. This is not surprising as the corresponding 
CDF estimates exhibit the same behavior as shown in Figure [1] This can be explained 
by the fact that the discontinuities of Pn at the smallest and largest order s tatistic of 



the sa mple under consideration are smoothed out by p*. Finally, the test bv iHazelton 



(120111 ) yields a p- value to assess the null hypothesis of log-concavity of p = 0.84 for the 



controls and p = 0.83 for the cases. 
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6 Simulations 



Having shown asymptotic equivalence to the empirical estimate, it remains to empirically 
verify that, apart from its inherent properties such as smoothness and no need to choose 
any regularization parameters such as kernel and bandwidth, we additionally enjoy a gain 
in efficiency by using Rm,n instead of Rm,n- To this end, we have performed a simulation 
study for the scenarios provided in Table [1] Scenarios 1-3 serve as a benchmark for 
comparing our estimators to the binormal model. In addition, Scenar io 2 has been 
analyzed in lLlovd and YongI ( 1l999l ) and Scenario 3 in lZhou and Lid ( 120081 ). what enables 
a direct comparison to these estimators. Scenario 4 is intended to evaluate the effect 
of only one distribution being non-normal but skewed. This seems a plausible setup 
in ROC applications where values for the controls may be normal but those of cases 
are right-skewed. Scenario 5 assesses the performance in case both distributions are 
right-skewed. Finally, Scenario 6 evaluates the methods for symmetric but non-normal 
distributions. 



Table 1: Scenarios we simulated from. Ga(a, Gamma distribution with shape 
parameter a and rate parameter /3. Log(/i, s): Logistic distribution with location pa- 
rameter n and shape parameter s. 



Scenario 


F 


m 


G 


n 


has been used in 


1 


N(0, 1) 


20 


N(l, 1) 


20 




2 


N(0, 1) 


100 


N(l, 1) 


100 


Llovd and Yone (1999. Fiaure 1) 


3 


N(0, 1) 


100 


N(2, 1.2) 


100 


Zhou and Lin (2008. Fieure 1. left v\ot) 


4 


N(2, 1) 


100 


Ga(2, 1) 


100 




5 


Ga(2, 1) 


100 


Ga(4, 1.5) 


100 




6 


Log(0, 1) 


100 


Log(2, 1) 


100 





In our simulations, we compare the empirical ROC estimate Rm.n, the ROC estimates 
based on the two variants of log-concave density estimate (maximum likelihood and its 
smoothed version), the k ernel estimate -R^ „ with o ptimal bandwidths to estimate the 



ROC curve introduced in lHall and Hyndmaru ( l2003l ). and the fully parametric binormal 



model. The latter model is well-specified only if both F and G have a normal density. 
We would like to emphasize that the binormal model was chosen as a competitor because 
it serves as a parametric benchmark if in fact F and G indeed have a normal density and 
our simulations show, that Rm,n and i?^,n are only slightly worse than this benchmark. 
However, we generally discourage the use of binormal models in practice. 
To evaluate the performance of our competitors we use the average square error (ASE), 
defined for a ROC curve estimate -R of a true ROC curve R as 

^grid 



ASE{R) = ng-i,5^(i?K)-i?K; 



k=l 



for grid points Ui,i = 1, . . . , rigrid- This criterion has been used in IZhou and Lid (120081 ) 
to evaluate different variants of binormal ROC curve estimates. Following the same ap- 
proach as in the latter paper we choose the Uj's to be equidistant on [0, 1], rignd = 100 and 
we generated M = 500 samples for each scenario. Using the same criterion and setting 



9 



all the simulation parameters to the values already used in (jZhou and Liru . l2008l left plot 
of Figure 1) enables direct comparison of our Scenario 3 to their results and we provide 
the analogous plot in Figure |H In Figure |3] we display the results for all the six scenarios 
from Table [1], reported as follows: For each simulation run, the ASE is computed for the 
four estimators and {ASE[R)j/ASE{Rm^n)jY^'^ for R G {Rm^n, Rm Bliu ^m^n} and 
j = 1, . . . , M are reported. Thus, each of the four estimates is benchmarked with respect 
to the empirical ROC curve Rm,n- Figure [3] allows for the following observations: In the 
purely normal setting (Scenarios 1-3), the log-concave estimators are generally more 
efficient than the empirical, to an even remarkable extent for Scenario 3. We attribute 
this to the fact that here, the ROC curve is rather steep for small values on the abscissa, 
a shape that Rm,n and R^n ^t)le to more efficiently capture than the empirical. 
The kernel estimator R^^ outperforms the empirical to an extent comparable to the 
log-concave estimates. Finally, compared to the fully parametric binormal model the 
loss of efficiency for the log-concave versions and the kernel estimate is notably small. 
W hen "visually ave r aging " the pointwise root mean square reductions over t e (0, 1) 



m 



Llovd and Yond ( 11999 



Figure 1, left plot) we see that their kernel estimate does 
approximately 15% better than the empirical estimate, for data simulated from our 
Scenario 2. This roughly corresponds to our median reduction in that scenario. 
The gain of the log-concave ROC estimates over the empirical estimate is maintained for 
non-normal though still log-concave setups (Scenarios 4-6), whereas in these cases the 
binormal model is certainly misspecified and therefore underperforming. In Scenarios 5 
and 6, Rm,n and K^^^ provide values of {ASEY^'^ with less variability than R^^- 
The effect of smoothing the maximum likelihood density estimate in ROC curve esti- 
mation is for all the analyzed s cenarios rather small . This is to be expected given that 



20 111 . Propositon 3) and the fact 



these two estimates are close fiChen and Samworthl . 
th at they are int e grate d when computing the ROC curve. 

In IZhou and LinI (120081 ) a new estimate for a and h in the semiparar netric normal model 
i s intr oduced and compared to a few competitors, e.g. the one from lCai and Moskowitz 
(120041 ). Note that all these estimators are rather complicated to compute and, to the best 
of our knowledge, no easy ac cessible implementati on for these estimators is available. 
The left plot in Figure 1 of IZhou and LinI (120081 ) reports results for our Scenario 3 
and the analogous result using the esti mators from our simu lation study is displayed 
in Figure |H Although the estimates in IZhou and LinI (|2008| . Figure 1, left plot) were 
explicitly tailored for the binormal model, our log-concave estimates are only slightly, if 
at all, less efficient, which is consistent with the results shown in Figure [31 The same 
applies to R^ „. 



7 Bootstrap confidence intervals 



To construct a confidence interval around Rm,n at a fixed point t G [0, 1], one can, by 
virtue of Theorem 14.11 use standard errors based on asymptotic theory for Rm,n- Asymp- 
totically, such an interval maintains the pre-specified coverage probability, provided that 
the log-concavity assumption holds. However, to exploit the gain in efficiency for finite 
samples in computation of confide nce intervals f or val ues of the true ROC curve, we 
suggest to proceed as sketched in iDu and Tang] ( 120091 ) for a similar spline estimator. 
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Scenario 1 : 20 N(0, 1 ) vs. 20 N(1 , 1 ) 



log- smoothed 
concave log-concave kernel binormal 



o 
O 
cr 
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Scenario 3: 100 N(0, 1) vs. 100 N(2, 1.2) 
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Scenario 5: 100 Ga(2, 1) vs. 100 Ga(4, 1.5) 



log- smoothed 
concave log-concave kernel binormal 



o 
O 
cr 



Scenario 6: 100 Log(0, 1) vs. 100 Log(2, 1) 
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Figure 3: Results of simulation study. Horizontal dashed lines at 0.75, 1. Note the 
different scalings of the y-axis. 
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Scenario 3: 100 N(0, 1) vs. 100 N(2, 1.2) 
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Figure 4: ASE for Scenario 3, to be compared with IZhou and LinI fl2008l . Figure 1, 
plot). Horizontal dashed lines at 0.001, 0.002. 



left 



Namely, draw B bootstrap resamples {X*}^]^ and {Y^ }"^^ from the original data and 
compute for each of these resamples the estimator Rm,n- This yields bootstrapped ver- 
sions -R*„ of the estimated ROC curve. The (1 — a) confidence interval at a point 
t G [0, 1] can then be based on the quantiles of this bootstrap distribution, i.e. we com- 
pute [{R*,nit))a/2,{R*,nit))i-a/2] where (^#„(t))a/2 and (^*„(t))i-a/2 are the (a/2)- 
and (1 — a/2)-quantile of the /2*^(t)'s from the bootstrap samples. 
To assess the performance of these intervals we proceeded as follows: For the scenarios 
in Table [1] we computed the above intervals at t G {0.1,0.3,0.5,0.7,0.9} for B = 500 
and 1 — a = 0.95. As a benchmark we compare the proportion of these intervals 
out of M = 500 simulation runs that cover t he true value R(t) to the same proportion 
computed using the BTII interval proposed in lZhou and Qiru (120051 ). BTII is a bootstrap 
confidence interval that does not only estimate the true positive fraction at a fixed t, 
but also accounts for the uncertainty due to the fact that we estimate the quantile 
corresponding to t based on the controls. IZhou and QinI (120051 ) show via simulations 
that BTII has superior coverage accuracy and shorter interval length compared to other 
approaches over a wide range of scenarios and for sample sizes comparable to those in 
our simulation setup. 

In Figure[5]we present the results. We find that coverag e proportions are basi cally identi- 
cal for BTII and the interval based on Rm,n, for t < 0.7. IZhou and Qiru ( l2005l ) noted that 
the BTII interval performs poorly for values of t that correspond to high (> 0.95) sensi- 
tivities and small sample sizes. Looking at the smallest values of t where the true R{t) > 

0. 95 in our six scenarios from Tabled! we get values of 0.74/0.74/0.49/0.95/0.77/0.72, 

1. e. our results are in line with that rule-of-thumb: for Scenarios 1 and 3, BTII is 
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Coverage proportions for confidence intervais over 500 simuiations 

+: log-concave / o: BTII. Numbers at the bottom indicate Scenario from Table 1 . 
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Figure 5: Performance of confidence intervals for i? at a fixed t. Comparison of boo tstrap 



intervals based on Rm,n and the BTII intervals discussed in IZhou and QinI (l2005[ l . 



performing very poorly for t = 0.9, whereas the interval based on Rm.,n has a better 
performance. Both methods approximately have the same difficulties in reaching the 
prescribed level for t = 0.9 in scenarios 2, 5, 6. However, as can be inferred from the 
lower plot in Figure |5l the intervals based on Rm,n generally yield confidence intervals 
with a shorter average length over the M = 500 simulations, a gain that is in line with 
the simulation results in Section |6l 



8 Performance if / and/or g is not log-concave 



First of all, we would lik e to point to recent work on estimation of a log-concave density 
under misspecification. ICule and SamworthI (120101 ) show that the log-concave density 
estimator fn converges to the density g that minimizes the KuUback-Leibler distance 
to the true density /, in some exponentially weighted total variation norm. If / is 
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indeed log-concave, this implies a strong type of consistency for the log-concave density 
estimator. 

Now, it is by no means immediate how this generalizes to our ROC curve estimate Rm,n- 
To assess the robustness of Rm,n, i-e. scenarios where at least one of the constituent 
distributions does not have a log-concave density, we extended the simulations reported 
in Section [6] to incorporate non- log- concave densities, see Section C in the appendix. 
Additional scenarios are given in Table [2l The conclusions from these simulations are 
that (1) Rm,n remains competitive even for moderate deviations from log-concavity for 
either / and/or g, and that (2) for srnall sam ples it even still outperforms the kernel- 
based estimator by iHall and HyndmanI (120031 ) . 



9 Conclusions 

We have presented a new approach to estimate ROC curves, which is based on plug- 
ging in distribution function estimates received from log-concave density estimates of 
the cases and controls data instead of their empirical counterparts. We propose boot- 
strap confidence intervals at a fixed t and show asymptotic equivalence between the 
empirical and the new ROC curve estimate if the log-concavity assumption holds. The 
performance of the new method is assessed via simulations and illustrated on a medical 
dataset. In what follows, we recapitulate the reasons why we consider our new approach 
useful. 

In applications, it seems sensible to assume that the true underlying ROC curve R is 
in fact smooth, thus we expect a better performance of an estimate by imposing some 
degree of smoothness. In addition, it is visually appealing to have a smooth estimate 
rather than a jagged function like Rm,n- 

Many parametric densities are in fact log-concave. As illustrated for the normal dis- 
tribution in Scenarios 1 to 3 of our simulation study, not much efficiency is lost when 
considering log-concavity instead of the actual parametric model if the underlying distri- 
butions are indeed Normal. Besides, assuming log-concavity is much more flexible and 
robust to misspeciflcation than imposing some parametric model, especially the widely 
used binomial model. 

In our simulation study we have illustrated that over a wide spectrum of distributions, 
the estimators Rm,n and R^ „ perform remarkably better than the empirical ROC curve 
for flnite sample sizes and have an efficiency that is comparable, and in some scenarios 
slightly better, than the kernel estimator Rrnn- Confidence intervals based on Rm,n have 
comparable covera ge probabilities but sl ightly shorter average interval lengths than the 



BTII interval from lZhou and Oid (120051 ). if the log-concavity assumption indeed holds. 
Taking into account these points, we advocate the use of our estimate since (1) we do 
not loose much, if anything at all, with respect to R^ „ or the binormal model and gain 
substantially with respect to the empirical ROC curve estimate but (2) are much more 
robust to misspecification than the binormal model. As opposed to kernel estimates, 
both our estimators Rm,n and Kj^ „ are fully automatic. 

It was shown that, if the underlying densities are log-concave, Rm,n is asymptotically 
equivalent to Rm,n, so our new estimator can be considered a "smoother" of the em- 
pirical ROC curve. This "smoothing property" of P„ has given rise to at least two 
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applications where empirical performance of a method cou ld be substantially irnproved 
by using P„ in place of P„ as an estimator of the CDF. In iMiiller and RufibachI ( 120091 ) 
it was demonstrated that using quantiles based on P* instead of order statistics reduces 
estimation variability to a much greater extent in comparison to the bias introduced 
in tail index estimation in extreme value theory. This leads t o a su bstantial reduction 
in mean squared error. Similarly, in iDiimbgen and RufibachI (120111 ) a simulation study 
demonstrates increased power in the comparison of two distribution functions when us- 
ing the Kolmogorov-Smirnov statistics based on the log-concave instead of the empirical 
distribution function estimate. 



10 Software and reproducibility 

Estimation of a univariate log-concave densit y and many additional related quantities 
is implemented in the R package logcondens (jPumbgen and RufibachI . 1201 ll ) available 
from CRAN. We have added the function logConROC to this package that gives the 
ROC curve estimates Rm,n and ^ as well as the corresponding area under the curve 
(AUC) in just one line of code. The bootstrap confidence intervals from Section [7| are 
implemented as the function conf IntBootLogConROC.tO in logcondens. Note that the 
active-set algorithm used to maximize the log-likelihood function for the log-concave 
density estimate is remarkably efficient so that it only takes seconds to compute Rm,n 
for rather large sample sizes m and n, e.g. of order m = n = 10^. For small to moderate 
sample sizes computation is immediate. The pancreas dataset used to illustrate our 
estimate is also included in l ogcondens and thus readily accessible. The code to com- 
pute the kernel estimator by iHall and HyndmanI ( 120031 ) is available from the author's 
webpage. All this enables straightforward reproducibility of t he plots and simula tions. 
In addition, the function smooth. roc in the R package pROC (IRobin et al.l . l201ll ) offers 
the option to fit a ROC curve based on lo g-concave den sity esti mates. 



This document w as created using Sweave (iLeischl. 120021) . I^T[7X( lKnuthl . Il984l : iLamport 



994f). R 2.14.0 (IR Development Core Teaml. |2010|) wit h the R packages logcondens 



( Diinabgen and RufibachI . l201lL iRufibach and Diimbgenl. 20 111 Version 2.0.6), xtable 



(iDahli l2009[ Version 1.6-0). and cacheSweave (IPenel . l2008l . Version 0.6) 



Appendix 

A Additional details on log-concave density estima- 
tion 



Here, we provide additional de tails and references on log-concave density estimation. 
Diimbgen and RufibachI (2009) show that the maximizer ipn of i is unique, piecewise 
linear on the interval [V(i), V(„)] with knots only at (some of the) observations V(j), and 
(fn = —oo elsewhere. Here V(i) < V(2) < ■ ■ ■ < V^n) are the ordered observations, 
and a "knot" of ipn is a location where this function changes slope. The MLEs ipn, 
Pn = exp(fin and Pn{t) = j^_^Pn{x) dx are consistent with certain rates of convergence. 
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see 



Diimbgen and RufibachI (120091 ) and lBalabdaoui et al.l (120091 ) as well as Theorem lA.ll 



below. 

The crucial feature of the estimate in our context is summarized in Theorem lA.ll 



Theorem A.l. iDumbgen and Rufibadi . \200yi . Theorems 4.1 and 4.4] Assume that the 
log-density (f = \ogp G 1-L^'^{T) for some exponent (3 E [1,2], some constant L > and 
a subinterval T = [A, B] of the interior of the support {p > 0}. Then, 

] and pn = \og{n)/n. In particular, this 



max 

teT(n,/3) 



where T{n, 13) := [A 
entails that for (3 e [1,2] 



1/(2/3+1) 
yn 



B 



1/(2/3+1)1 



max |P„(t) -P„(t)| 



Op[n 



-1/2^ 



where P„ is the empirical CDF of Vi, . . . ,Vn. 

Note that the result for ipn remains true when we replace 0n ~^^y Pn—p- Furthermore, 
it is wel l-known that the r ates of convergence in Theorem 1 A. 11 are optimal, even if [3 was 
known (IHas^minski]] . Il978[ ). Thus the log-concave (log-)density estimator adapts to the 
unknown smoothness of p in the range (3 G (1, 2]. 



B Estimation of the AUG 



In Section [6] we have shown that Rm,n and „ (and „) are valuable approaches 



to estimate the ROC curve. In this section, 



we will show that this holds true also 

Section 4.3.1) 



2003 



in estimation of the AUG. AUG is the most widely used (iPepe 
summary measure for ROG curves and typically re ported in the analys i s of a diagnostic 
test. First, note that Theorem 14.11 together with iHsieh and Turnbulll ( 119961 . Theorem 
2.3) implies that 



^/^{AUC - AUC) 



/ [R m,n 

{t)-R{t)) dt 



N{0, a 



AUc) 



SO that the AUG based on Rm,n and Rm,n share the same distributional limit (under the 
assumptions of Theorem 4.1) as well and asymptotic results for the empirical AUG are 
valid for the AUG based on the log-concave ROG . The expression for the a symptotic 
variance cx^uc empirical AUG is pro vided in iHsieh and Turnbulll (Il996[ Theorem 

2.3), see also the discussion in iPepd (l2003l . Section 5.2.5). 



C Estimation of Rm,n cind AUC under misspecifica- 
tion 

Here, we report on additional simulations when either / and/or g are misspecified, i.e. 
not log-concave. We have chosen the Lomax densities as these are an often-used and 
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Scenario 7: 20 Lomax(3, 7) vs. 20 Lomax(5, 3) 



Scenario 8: 100 Lomax(3, 7) vs. 100 Lomax(5, 3) 
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Scenario 9: 20 t(5, 0) vs. 20 1(5, 2) 
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Scenario 10: 100 1(5, 0) vs. 100 t(5, 2) 
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Scenario 11: 
20 N(0, 1) vs. 20 0.75N(2.5, 1) + 0.25N{2.5, 3) 



log- smoothed 
concave log-concave kernel binormal 



Scenario 12: 
100 N(0, 1) vs. 100 0.75N(2.5, 1) + 0.25N(2.5, 3) 
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Figure 6: Results of simulation study for misspecified models. Horizontal dashed lines 
at 0.5, 1, 1.5. Note the different scalings of the y-axis. 
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Table 2: Scenarios we simulated from to assess performance of the new estimator in 
misspecified models. 



Scenario 


F 


m 


G 




n 


7 


Lomax(3, 7) 


20 


Loinax(5, 3) 




20 


8 


Lomax(3, 7) 


100 


Loinax(5, 3) 




100 


9 


t(5, 0) 


20 


t(5, 2) 




20 


10 


t(5, 0) 


100 


t(5, 2) 




100 


11 


N(0, 1) 


20 


0.75-N(2.5, 1) - 


1- 0.25-N(2.5, 3) 


20 


12 


N(0, 1) 


100 


0.75-N(2.5, 1) - 


1- 0.25-N(2.5, 3) 


100 



unbiased model. The t- and the normal mixture densities were selected as they are still 
unimodal but have slightly "too heavy" tails to still be log-concave. 
The ASE relative to the empirical estimate f or the simulations from Tab le [2] are displayed 
in Figure [6l As expected, the estimator by iHall and HyndmanI (120031 ) now clearly out- 
performs Rm,n- However, the advantage of the latter estimator is only substantial for the 
Lomax scenarios whereas for the t and normal mixture setups the two estimators Rm,n 
and R^n perform comparably, even with some advantage for Rm,n- We thus conclude 
that Rm,n remains competitive even for moderate deviations from log-concavity in either 
F and/or G, and that f or sra all samples it still outperforms the kernel-based estimator 
bv lHall and Hvndmanl fcoOSh . 

As a matter of fact, if either F and/or G has a bimodal density it does not seem sensible 
to use Rm,n and we have thus omitted such scenarios from our misspecification simulation 
study. It seems safe to assume that in such a scenario, R^^ „ will outperform Rm,n and 
that the latter should not be used based on substantive arguments. 
As a final remark on misspecified models we would like to emphasize that one typically 
applies shape constraint estimates based on substantive knowledge, i.e. knows that the 
densities of F and G are log-concave, or one assesses this claim via formal testing, or 
via contrasting the log-concave to kernel density estimates. In addition, as it comes out 
of the simulations, Rm,n is somewhat robust against misspecification. 



D Proof of Theorem 14.1 



We write 



n sup 



R{t] IFm., G„) — R{t] Fm, Gn) 



nsup 
teJ 

n sup 
n sup 



(1 - G„(F„i(l - t))) - (1 - G^{F~\1 - t))) 
G4F-\t)) - G4¥-\t)) 



Gn{q 



< sup 

teJ 

=: Ti{n,m) +T2{n,m) 



.{q 



n\qi,t, 



n sup 



9n[qi,t + Orn{q2,t " qi,t) ) {q2,t - qi 
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where we defined qi^t = ^mit) and g2,f = F^^(t) and 9rr,. is some value in (0 , 1) for all 
m. By t he Bahadur-Kiefer the orem (see e.g. Ivan der Vaart and Wellnerl . Il996l . Example 
3.9.24 or Ivan der Vaartl . Il998l . Section 21.2) we can write for any t e J ¥z}(F(t)) = t + 



i¥^t)-F{t))+o, 



[n 



-1/2) 



what together with strong consistency of f ivan der Vaart 



19981 . p. 265) implies that for any e G (0, 6] we can find an uq large enough so that for 
all m,n > no almost surely 



F-\F{A + 6))-A-6-pn 



< 



where p„ = log(?7,)/?7,. But this implies that ¥^{F{A + S)) > A + pn and similarly 
F~-^(F(5 — S)) < B — Pn with probability one. Thus, 



Ti{n,m) = ^Jn sup ^^(gi^t) - G„(gi,t^ 
and the latter expression is (almost surely) not smaller than 



n sup 

tG[F(A+5),F(B-5)] 



n sup 

we[F™'oF(A+5),F„ioF{B-<5)] 



n sup 



Gr. 



u 



u 



0p(lj 



by virtue of the second statement of Theorem 4.4 in lDiimbeen and RufibachI (120091) . 



As for Tyj m, n) we first note q„, is bou nded with probability one f Pal et al. . 20071 Theo- 



rem 3 .2 and lCule and Samworthl . l2010l . Lemma 3). Then, using again lDlimbgen and Rufibach 
(I2OO9I . Theorem 4.4) one can show that under the assumptions C2 and C3 



sup|F^(t) -F(t)| 



< sup|F„(t) -F(t)| +0p 



n 



(3) 



what implies that y/n{Fm — F) converges weakly in D[J] (the space of cadlag functions 
on J) to the same limit as y/n{¥m — F), namely to BoF for a standard Brownian Bridge 
B. But convergence of the estimated distribution function i n D\J] implies convergence 
of the corresponding quantile process in -D[J], by virtue of Ivan der Vaart and Wellner 
( 1l996l . Lemma 3.9.23). Thus, for some generic constant C > 0, 



T2 (n, m) = y/n sup ^„ ( + 6.^ {q2,t - qi,t)) {q2,t - qi,t) 



< CVX{1 + Op(l))v^sup F-'{t) - F-i(t) 



teJ ^ 



oFiF-\t)) 



+ 



^;n\t)-F^\t) 



FiF-^t)) 



via 



fiF~m) 

van der Vaart and Wellnerl ( 1996 . Lemma 3.9.2 3). 



«p(l) 



□ 



For the empirical ROC cu rve this is carried out in iTang et al.l ( l2008l . Web supplement) 
and lTang and Zhoul (l2009l. Web supplement). 



Now, by Theorem 4.3 in lDiimbgen and RufibachI (120091 ) . again under the assumptions of 
Theorem 14. H we have that y/n{F — F) has the same limiting distribution as ^/n{¥ — F). 
This in turn implies that the above result carries over to Rm.n- 
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Comments on the assumptions of Theorem 14.11 As it can be inferred from the 
proof, the crucial assumption is not log-concavity of the underlying densities, but rather 
the Dvoretzky-Kiefer-Wolfowitz inequality ([3]) which must hold for the CDF estimate. 
Indeed, log-concavity is only one possible assumption that entails this property. The 
density whose quantile function is involved in the ROC curve estimate needs to have 
finite support, since it must fulfill the assumptions for the Bahadur-Kiefer theorem 



(Ivan der Vaartl . 119981 . Section 21.2). However, it is easy to see that any truncated log- 
concave density remains log-concave, so that this assumption does not seem to be too 
restrictive. 
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